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ABSTRACT 

In all organisms, aminoacyl tRNA synthetases cova- 
lently attach amino acids to their cognate tRNAs. 
Many eukaryotic tRNA synthetases have acquired 
appended domains, whose origin, structure and 
function are poorly understood. The N-terminal 
appended domain (NTD) of glutaminyl-tRNA 
synthetase (GlnRS) is intriguing since GlnRS is pri- 
marily a eukaryotic enzyme, whereas in other 
kingdoms Gln-tRNA Gln is primarily synthesized 
by first forming Glu-tRNA Gln , followed by conver- 
sion to Gln-tRNA Gln by a tRNA-dependent amido- 
transferase. We report a functional and structural 
analysis of the NTD of Saccharomyces cerevisiae 
GlnRS, Gln4. Yeast mutants lacking the NTD 
exhibit growth defects, and Gln4 lacking the NTD 
has reduced complementarity for tRNA Gln and glu- 
tamine. The 187-amino acid Gln4 NTD, crystallized 
and solved at 2.3 A resolution, consists of two 
subdomains, each exhibiting an extraordinary struc- 
tural resemblance to adjacent tRNA specificity- 
determining domains in the GatB subunit of 
the GatCAB amidotransferase, which forms 
Gln-tRNA Gln . These subdomains are connected by 
an apparent hinge comprised of conserved 
residues. Mutation of these amino acids produces 
Gln4 variants with reduced affinity for tRNA Gln , 



consistent with a hinge-closing mechanism 
proposed for GatB recognition of tRNA. Our 
results suggest a possible origin and function of 
the NTD that would link the phylogenetically 
diverse mechanisms of Gln-tRNA Gln synthesis. 

INTRODUCTION 

Aminoacyl tRNA synthetases perform a critical function 
in conversion of the genetic code into amino acids by co- 
valently attaching the correct amino acid to specific 
cognate tRNAs (1,2). These enzymes are divided into 
two structural classes, each arising from a common 
ancestor (3,4), and catalyze aminoacyl-tRNA formation 
by a two-step pathway: (i) an activated aminoacyl adenyl- 
ate is first formed from ATP and the cognate amino acid; 
(ii) the amino acid is transferred to its cognate tRNA with 
release of AMP. Each synthetase nearly perfectly selects 
the correct tRNA among 20-22 different isoacceptor 
tRNA families (5) as well as the correct amino acid sub- 
strate; in some cases, this is achieved via the use of hydro- 
lytic editing mechanisms to clear misactivated amino 
acid and/or misacylated tRNA (3,4). It is of particular 
interest that tRNA Gln and tRNA Asn are aminoacylated 
by distinct mechanisms in different kingdoms. For 
example, whereas Gln-tRNA Gln is formed in the canonical 
manner in the eukaryotic cytoplasm, all archaea, many 
bacteria and eukaryotic organelles possess an alternative 
two-step pathway. In this route, a non-discriminating 
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GluRS first misaminoacylates tRNA ; next, the 
Glu-tRNA Gln is converted to Gln-tRNA Gln by a 
tRNA-dependent amidotransferase belonging to either 
the GatCAB family (bacteria and some archaea), or the 
GatDE family (archaea only) (6-8). Thus, glutaminyl- 
tRNA synthetase (GlnRS) is primarily a eukaryotic 
enzyme. Synthesis of cysteinyl-tRNA Cys in methanogens 
and highly related archaea provides another example of a 
two-step pathway to cognate aminoacyl-tRNA, although 
the phylogenetic distribution of this pathway is much 
more limited (9). 

Eukaryotic tRNA synthetases are distinctly more 
complex than their prokaryotic homologs because they 
have progressively acquired and retained additional 
domains throughout evolution (1,2). It is perplexing 
why tRNA synthetases, unlike other eukaryotic proteins, 
have been subject to massive progressive additions over 
the course of evolution (2). While some appended 
domains are shared among synthetase families and are 
similar to domains in other proteins implicated in either 
nucleic acid binding or protein-protein interactions (1), at 
least eight domains are uniquely associated with a single 
synthetase family, and neither their structures nor their 
roles are generally understood (2). An exception is the 
CTD of human CysRS, which is known to enhance anti- 
codon discrimination at the expense of the aminoacylation 
rate, acting as a quality control step (10). This report 
focuses on the NTD of GlnRS, which is itself unique 
because GlnRS likely originated in eukaryotes, evolving 
directly from a progenitor eukaryotic non-discriminating 
GluRS (11,12). Like other eukaryotic GlnRS species, 
Saccharomyces cerevisiae Gln4 contains both a highly 
conserved C-terminal domain (CTD) with all of the 
known features of class I synthetases, as well as a less 
conserved appended N-terminal domain (NTD) with no 
obvious sequence homology to any known protein 
domain. 

The origin and function of the NTD in GlnRS are of 
particular interest. Most eukaryotic GlnRS proteins have 
an appended NTD, whereas the bacterial GlnRS proteins 
do not, although the bacterial proteins were almost cer- 
tainly acquired by horizontal transfer from eukaryotes. 
Saccharomyces cerevisiae GlnRS contains both a 
595-amino acid CTD that contains the signature 
elements of a type I synthetase (4,13-15), and suffices 
for both catalytic function and yeast viability (16,17), 
and a 224-amino acid NTD that is uniquely associated 
with GlnRS in many eukaryotes (2). Although both 
Escherichia coli and Deinococcus radiodurans GlnRS 
proteins share extensive identity with the conserved 
5*. cerevisiae GlnRS CTD, E. coli GlnRS entirely lacks 
an NTD (13) and D. radiodurans GlnRS has an unrelated 
domain appended to the C-terminus of the conserved 
domain (14). Two observations imply that the S. cerevisiae 
NTD contributes to synthetase function: the NTD alone 
exhibits a non-specific RNA binding activity (18), and the 
addition of the NTD to iicGlnRS results in a chimeric 
protein that can replace the native yeast gene (19). 
However, the precise role of the NTD in eukaryotic 
GlnRS function is unknown. 



MATERIALS AND METHODS 

Genetic analysis of gln4 mutants 

To construct a strain (MEM70) of genotype gln4-A::kan R 
[CEN URA3 GLN4], a CEN GLN4 plasmid was trans- 
formed into yeast strain BY4741, and then the 
gln4-AKan allele was introduced by transformation, 
using PCR primers HWI P239 and HWI P234 
(Supplementary Table SI) to amplify the fragment from 
the appropriate GLN4/ gln4-AKan heterozygous diploid 
(Open Biosystems ID 22424). To construct strains 
bearing an integrated copy of either GLN4 or gln4-A2- 
210, we used an integrating cassette (20) that carries 
MET15 flanked by sequences homologous to ADE2, 
into which we inserted GLN4 or the gln4( 21 1-809) allele 
(constructed with a synthetic fragment made by Geneart). 
Plasmids were then digested with Stu I to release the 
integrating cassette and transformed into MEM70, and 
transformants were screened for Ade~, and plated 
on FOA to select for removal of the CEN URA3 GLN4 
plasmid, generating the desired gln4-A::kan R ade2~\: 
GLN4::MET15 (MEM133) and gln4-A::kan R ade2~:: 
gln4(211-809)::MET15 (MEM141) strains. To test for 
growth phenotypes, MEM133 and MEM141 were trans- 
formed with a [CEN LEU2 GLN4] or a control [CEN 
LEU2] vector, grown overnight in SD-Leu media (21), 
diluted to OD 60 o of 1 and 2ul of 10-fold serial dilutions 
were spotted onto plates containing either YPD or YP 
glycerol and incubated at the indicated temperatures for 
1-7 days with similarly spotted control parent strains that 
were grown in YPD media. Oligonucleotides, yeast strains 
and plasmids used in these studies are reported in 
Supplementary Tables S1-S3. 

Protein expression and purification 

To express high levels of GLN4 and its derivatives in yeast, 
ORFs were cloned under Pgali control into the previously 
described 2 u URA3 LIC vectors BG2483 or BG2663, in 
which ORFs are expressed with their C termini fused to a 
complex tag containing a 3C protease site, followed by an 
HA epitope, His 6 , and the ZZ domain of protein A (22), 
and expressed in yeast strain BCY123 (23). Gln4(l-187) 
was expressed in yeast strain EJG1473, which was 
grown in media containing selenomethionine and 
Ado-Methionine as described (24). Expressed pro- 
teins were purified by affinity purification on IgG 
sepharose, removal of GST-3C protease, concentration 
of samples and sizing on SuperdexHiLoad 1660 (GE 
Healthcare 17-1069, lOmmx 300mm bed dimension), as 
described (22). 

tRNA purification and EMSA binding assay 

To obtain native yeast tRNA Gln(CUG) , we cloned the 
tQ(CUGJM gene into the leu2-d URA3 vector pYEX4T 
(25), transformed the plasmid into BY4741, grew 
transformants in SD-Ura media overnight, followed by 
overnight growth in SD-Leu-Ura media. We then 
prepared low molecular weight RNA, purified the 
tRNA Gln with biotinylated oligonucleotides oligo HWI 
P257 (Supplementary Table SI), and performed HPLC 
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analysis of modified nucleotides as described (26). 
The ratio of modified to unmodified nucleotides was 
similar to that in strains with tRNA Gln on a lower copy 
plasmid. 

tRNA binding was measured, as described (27) in reac- 
tion mixtures containing Gln4 or its buffer, 2.4 nM 5'- 
[ 32 P]-labeled tRNA, in buffer containing 28 mM HEPES 
(pH 7.5), 80 mM NaCl, 5mM MgCl 2 , 0.5 mM DTT, 
2.5 mM spermidine, 50ug/ml BSA, 20 uM EDTA, 
200ug/ml polyA, 4.6 mM Tris-Cl (pH 7.5), 1 mM 
(3-mercaptoethanol and 10% glycerol. Reactions were 
incubated for 20min on ice and loaded onto prerun 5% 
polyacrylamide gels containing 50 mM Tm-borate, pH 
8.3, lmM EDTA, 5mM MgCl 2 and 5% glycerol, and 
run at 4°C in the same buffer without glycerol. 

In vitro synthesis of tRNA transcripts 

Duplex DNA templates for in vitro transcription of yeast 
tRNA Gln were synthesized from two single-stranded 
oligodeoxynucleotides containing a complementary 
overlap duplex region, as described (28). The two 
3'-terminal deoxynucleotides on the non-coding strand 
incorporated 2'-0-methyl sugars (mU and mG in the se- 
quences), to improve the fidelity of transcription termin- 
ation by T7 RNA polymerase. Milligram quantities of 
each tRNA were transcribed with the Del( 172-1 73) 
variant of T7 RNA polymerase, as described (28,29), 
and purified by denaturing polyacrylamide gel electro- 
phoresis. tRNA was stored at 200 uM in lOmM Tris 
(pH 8.0), 1 mM EDTA (TE buffer). 

Steady state methods 

tRNA Gln transcripts were 32 P-labeled at the 3'-terminal 
internucleotide linkage using the exchange reaction of 
tRNA nucleotidyltransferase (30-32), and purified again 
by gel electrophoresis. Steady state kinetics of tRNA 
aminoacylation reactions were performed in a buffer con- 
sisting of 50 mM Tris-HCl (pH 7.5), lOmM MgCl 2 and 
10 mM p-mercaptoethanol. tRNA was first refolded by 
heating to 85°C in TE buffer for 3min, followed by 
addition of MgCl 2 to lOmM and slow-cooling to 
ambient temperature. Two microliters of aliquots 
from the reactions were added to 5 ul of a quenching 
solution containing 400 mM sodium acetate (pH 5.2) 
and 0.1% SDS, followed by addition of 3-5 ml of 
0.01-0.1 mg/ml PI nuclease (Fluka) to digest the tRNA 
to 5'-phosphorylated nucleosides. The digestion products 
were spotted on PEI-cellulose thin layer chromatography 
(TLC) plates and developed in a solution containing 
100 mM ammonium acetate and 5% acetic acid. Raw 
data were quantified by phosphorimaging analysis, 
and corrected intensities were analyzed to obtain ini- 
tial velocities. K M and V raax were then obtained by 
Michaelis-Menten analysis. ATP (5 mM) was used in 
all reactions; saturation was confirmed for both 
FL-GlnRS and the NTD variant. The glutamine concen- 
trations used to determine AT M <tR for FL-GlnRS 
and Gln4( 187-809) were 10 and 60 mM, respectively; 
saturation was verified in each case. tRNA concentrations 
used were 20 nM-3 uM for FL-GlnRS and 500 nM-20 uM 



for Gln4( 187-809). To determine K M for glutamine, the 
tRNA concentrations used were 1 uM for FL-GlnRS 
and 15 uM for Gln4( 187-809). Enzyme concentrations 
were maintained at least 20-fold below tRNA concentra- 
tions for all experiments to ensure multiple-turnover 
conditions. 

Crystallization and structure determination 

Initial crystallization conditions were identified using a 
high-throughput microbatch-under-oil method (33). 
Crystals appeared after a 6-week incubation at 22°C in 
conditions containing 0.2 ul protein solution (8.9 mg/ml 
protein in 100 mM NaCl, 5% (v/v) glycerol, 2mM DTT, 
0.025% (w/v) NaN 3 , 20mM HEPES buffer, pH 7.5) and 
0.2 ul of precipitant solution (100 mM KC1, 100 mM Tris- 
HCl, pH 8 and 20% (w/v) PEG 4000). Crystals were 
extracted directly from the well and were determined 
suitable for X-ray data collection from initial screening. 
No further optimization took place. 

Remote MAD data collection was carried out at 100 K 
on beamline 1 1-1 of the Stanford Synchrotron Radiation 
Lightsource (SSRL) (34) with a MAR 325 CCD detector. 
To minimize radiation effects, the data collection protocol 
was designed with Best (35) automated within the Weblce 
analysis package (36). Integration, reduction and scaling 
took place with XDS (37). The structure was solved with 
Phenix (38). Using the remote wavelength data set the 
structure was refined through an iterative process 
using Phenix with manual model building with Coot 
(39). Validation was carried out with Molprobity (40). 
The structure was deposited as PDB ID 3TL4. 
Experimental and refinement details are given in 
Supplementary Table S4. Surface charge was calculated 
assuming vacuum electrostatics using PyMol. 

The sequences of several appended NTDs from GlnRS 
sequences of other organisms, listed in Table 2, were 
threaded to the Gln4(l-187) structure using SwissModel 
(41). As a control the reversed sequence was also threaded. 
From the models a Z-score was calculated using 
Prosa2003 (42) with a 20-residue moving window. The 
typical combined, pairwise and surface Z-scores for 
native proteins are (—6 to —12), (—3 to —7.5) and (—3 
to —8), respectively. Alignments were performed using 
the 'fit' function of PyMOL. Due to low sequence 
homology, only carbon alpha atoms were included in 
the alignment. Loops were removed prior to RMS 
deviation calculation. 

RESULTS 

Removal of the NTD impairs Gln4 function in vivo and 
in vitro 

To determine if the NTD is important for the essential 
function of Gln4, we compared the growth of yeast 
strains expressing either full length GLN4 or gln4 
lacking the NTD [gln4(2 11-809)] integrated into the 
chromosome under control of its own promoter, as the 
sole source of ScGlnRS. Growth of the gln4(21 1-809) 
mutant is impaired at 14°C and 19°C, but not at 30°C, 
on both YPD and YP glycerol media, and, as expected, 
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Figure 1. Deletion of the N-terminal domain of GLN4 impairs function. (A) Mutants bearing a gln4 mutation in which amino acids 2-210 are 
deleted are defective in growth at low temperature on YP media containing glucose or glycerol as a carbon source. Serial dilutions of strains with 
either wild-type GLN4 or gln4(2\ 1-809) (marked gln4-AN*) integrated at the ade2 locus in the gln4-AKanR mutant were grown as indicated. 
Indicated strains carry CEN plasmids either with or without GLN4. (B) Mutants bearing a gln4 mutation in which amino acids 2-210 are 
deleted are sensitive to the glutamine synthase inhibitor L-methionine sulfoximine (MSX). 



Table 1. Comparison of steady state kinetic parameters for Gln4 and Gln4 variants 



fccat(s^) *m' RNA (hM) k c JK M lRNA (M-'-s- 1 ) ^ M Gln (mM) k c JK M Gln (M~' -s" 1 ) 



FL-Gln4 1.4 ±0.2 0.19 ± 0.04 7.6 x 10 6 1.7 ±0.2 8.5 x 10 2 

Gln4 (187-809) 1.7 ± 0.3 5.85 ± 0.52 2.9 x 10 5 9.3 ± 0.3 1.8 xlO 2 

PVG-GlnRS 2.8 ±0.6 1.55 ± 0.51 1.8 xlO 6 NA NA 

FL-Gln4 + native tRNA 1.7 ± 0.1 0.14 ± 0.07 1.2 xlO 7 NA NA 



this phenotype is complemented by full length GLN4 
on a single copy plasmid but not by an empty vector 
(Figure 1A). In addition, the gln4(2\ 1-809) mutant is 
much more sensitive than wild-type to L-methionine 
sulfoximine, a highly specific inhibitor of glutamine 
synthase (43), which results in reduced concentrations of 
intracellular glutamine (Figure IB). These observations 
demonstrate that the NTD plays an important role in the 
function of the native yeast enzyme in vivo. 

Steady-state kinetic parameters were measured to 
directly assess the effects of the NTD on tRNA Gln 
aminoacylation. Substantial differences between full 
length Gln4 and Ghi4(l 87-809) were found. For the 
wild-type enzyme, similar ¥^ M tRNA (0.14 uM versus 
0.19 u.M) and k c . At (1.7 s -1 versus 1.4 s -1 ) were measured 
for affinity-purified native tRNA Gln and an unmodified 
transcript, suggesting that post-transcriptional modifica- 
tions do not have significant effects in this system. Using 
unmodified tRNA Gln<CUG) as substrate, we then found 
that Gln4( 187-809) exhibits a 30-fold increase in 
K M ' RNA (from 0.2 uM to 5.8 uM), and a 5.4-fold increase 
in K M C/M (from 1.7 mM to 9.3 mM) although the k CRt 



values are similar (1.4 s versus 1.7 s _1 ) (Table 1). We 
infer that the NTD influences the complementarity of both 
the tRNA and glutamine binding sites for their respective 
substrates, as also suggested by the sensitivity of the 
Gln4(21 1-809) mutant to L-methionine sulfoximide. 

Since the kinetic analysis suggested a role for the NTD 
in tRNA Gln binding, we developed an EMSA assay to 
directly measure binding. We find that yeast Gln4 binds 
tightly and specifically to fully modified t RNA Gln(CUG) 
purified from S. cerevisiae, with ~25nM Gln4 required 
for 50% binding (Figure 2A and B, see Figure 5) while 
>800nM Gln4 is required to bind comparably to 
tRNA phe (Supplementary Figure SI). Remarkably, 
Gln4( 187-809) binds only very weakly at 27 uM, 
1000-fold above the apparent K D of wild-type Gln4 
(Figure 2A and B), and other Gln4 variants Gln4(211- 
809) and Gln4(2 16-809) do not detectably bind 
tRNA Gln(CUG) (Figure 2A). Furthermore, there was no 
improvement in binding of Gln4( 187-809) in the 
presence of other Gln4 substrates including glutamine, 
ATP or the non-hydrolyzable ATP analog AMPPNP 
(Supplementary Figure S2). 
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Figure 2. The N-terminal domain of Gln4 is required for specific 
binding to native tRNA Gln<CUG) . (A) Gln4 variant proteins deleted 
for different amounts of the NTD exhibit reduced tRNA Gln(CUG) 
binding. (B) Gln4( 187-809) protein exhibits detectable binding to 
tRNA Gln<CUG) at high concentrations. 



The Gln4 NTD is structurally similar to two subdomains 
in the amidotransferase that distinguish tRNA Gln from 
tRNA Glu 

To further discern the function of the NTD, we solved 
the structure of the isolated NTD, which behaves as a 
discrete unit to confer function when fused to the E. coli 
GlnRS (19). We purified three NTD variants ending at 
amino acids 187, which spans the region of extensive 
identity between the NTD of GlnRS from multiple 
species (see below), 215 and 228, which covers the entire 
region without extensive homology to E. coli GlnRS. We 
obtained crystals of Gln4(l-187) that diffracted to 2.3 A, 
and solved the structure of a selenomethionine derivative 
purified from a yeast saml-A sam2-A mutant (24) 
(Supplementary Table S4). 

Gln4(l-187) consists of two alpha helical domains, the 
first from residues 1-1 1 1 containing a seven-helix bundle, 
and the second from residues 119-187 containing a 
four-helix bundle, which are connected by a seven 
residue Gi^VGn/JGIT linker (Figure 3A). One face of 
each domain is positively charged across the length of 
the domain, which might facilitate interactions with the 
negatively charged tRNA and provide the basis for the 
non-specific RNA binding activity of this domain (18) 
(Figure 3B). 

Although the NTD lacks sequence homology to any 
available structure, a DALI search (44) of the NTD 
and the individual domains revealed substantial struc- 
tural homology to the helical and tail domains of 
the GatB subunit of GatCAB, the glutamyl-tRNA 



N-terminus B ^ ' J 

\ 




Figure 3. Structure of Gln4(l-187) with comparisons to domains in S. 
aureus GatB (PDB ID: 3IP4). (A) Crystallographic structure of Gln4 
residues 1-187 in cartoon representation. The proposed hinge region 
(Gly 1 i 2 Vali 13 Glyi 14 ) is highlighted together with the likely interacting 
residue Trp 160 , and shown in stick representation. (B) Surface electro- 
static model of Gln4 residues 1-187, shown with two orientations 
rotated by 90° relative to each other, with positively charged residues 
colored blue. (C and D) Structural alignment of helical and tail 
domains of Gln4 NTD and S. aureus GatB (PDB ID: 3IP4) (45). (C) 
The crystal structure of Gln4(l-110) (red) is superposed to the helical 
domain of GatB(295-406) (cyan). (D) The crystal structure of 
Gln4(l 19-178) (red) is superposed on the tail domain of 
GatB(414-475) (cyan). 



admidotransferase, from Staphylococcus aureus (PDB 
ID: 3IP4) (45) and Thermotoga maritima (PDB ID: 
3AL0) (46) (Figure 3C and D; Supplementary Figure 
S3). The seven-helix bundle seen in the NTD yields an 
RMS deviation of 3.75 A using carbon alpha atoms in 
the alpha helices of 5. aureus GatB and 4.01 A when 
compared with T. maritima. However, a five residue inser- 
tion between helix 4 and helix 5 appears to shift the orien- 
tation of the remaining three helices of S. aureus GatB. 
When aligning these three helices separately, an RMS de- 
viation of 1.89 A is observed. The four-helix bundle of the 
C-terminal subdomain of the NTD has an RMS deviation 
of only 1.64 A compared with the S. aureus GatB tail 
domain, and 1.80 A compared with the T. maritima 
GatB tail domain. Since the GatB helical and tail 
domains make specific and non-specific contacts with 
tRNA Gln (46), we infer that the Gln4 NTD has similar 
biochemical function. Furthermore, it is likely that 
GlnRS NTDs from other eukaryotes adopt a similar 
structure, based on threading of these sequences to the 
Gln4(l-187) structure (47) (Table 2). 

The linker between the NTD subdomains is conserved and 
functionally important 

Three observations suggest that the linker that connects 
the two domains in Gln4 plays a crucial role in the tRNA 
binding function of this domain. First, the helical and tail 
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Table 2. Comparison of sequences threaded to the N-term Gln(l-187) structure 
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Figure 4. The linker between the two domains in Gln4(l-187) likely behaves as a hinge, is highly conserved and is important for tRNA binding. 
(A) Structure of Gln4(l-187) (red) superposed on TMGatB (light gray) and SylGatB (dark gray) by alignment of the tail domains. (B) Conservation 
of GlnRS NTD sequences, red->90%; blue->70%, with arrow at Gln4| 87 ., aligned using Multialin (49). (C) Conserved residues are highlighted on 
Gln4(l-187) according to the color code in B with the NTD backbone shown in light grey. (D) Close contacts between W160 of the Gln4 NTD and 
other residues. 
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domains of GatB are also connected by a linker, which 
appears to function as a flexible hinge that closes upon 
tRNA binding, based on differences in the orientation of 
the domains in the tRNA-bound (T. maritima) and 
tRNA-free (S. aureus) structures (45,46). In this regard, 
we note that the domains in the Gln4 NTD are oriented at 
an angle between that of the T. maritima tRNA-bound 
GatB and the 5. aureus tRNA-free GatB (Figure 4A). 
Second, although the linker sequences in GlnRS differ 
from the sequences in GatB, the linker sequences in 
GlnRS are among the most highly conserved amino 
acids in the Gln4 NTD family (Figure 4B). In a compari- 
son of highly divergent eukaryotes, although neither the 
length nor the sequence of the N-terminal domain is 
highly conserved, three of the seven amino acids in the 
linker region G112V113G114 are nearly 100% conserved 
(Figure 4B and C). Furthermore, Gn 2 appears to 
interact with Wi6o> 1 of the 10 other highly conserved 
residues in the NTD; the alpha carbon of Gl 12 is in van 
der Waals contact with C9 of W 160 (Figures 3 A and 4C 
and D). Third, G114 is predicted to be a hinge residue, 
acting as a flexible connector of the two domains, based 
on an elastic network analysis with the program 
HingeProt (48). 

Since the G112V113G114 residues of the linker are highly 
conserved, and since hinges frequently mediate conform- 
ational changes upon ligand binding (50), we considered it 
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Figure 5. Mutations in conserved amino acids in the putative hinge 
of the NTD affect the interaction of Gln4 with native tRNA Gln(cutf) . 
(A and B) EMSA wild-type and mutant Gln4 proteins (23-2017 nM). 
(C) Binding as a function of Gln4 protein concentration. 



likely that mutations in the linker region would impair 
function. Thus, we purified variant proteins in which 
G112V113G114 was replaced with AAA and with PVG 
and in which Wi 60 was replaced with F or A, and 
measur ed tRNA Gfn(CUG) binding. Although the variant 
proteins all bind tRNA Gln(CUG) , as measured by reduced 
mobility of the tRNA, all of the mutant proteins exhibit 
defects in binding (Figure 5A and B). Three variants 
(Gln4-A 112 A 113 A 114 , Gln4-G„ 2 P, Gln4-W 160 A) fail to 
form stable complexes with tRNA Gln(CUG) , as judged by 
lack of comigration of the complexed tRNA with that 
formed by wild-type Gln4, and all four variant proteins 
exhibit an apparently reduced affinity for tRNA <CUG) , 
requiring 4-12 times more protein than the wild-type to 
bind comparable amounts of tRNA (Figure 5C). 
Moreover, the Gln4-Gn 2 P variant exhibits a 10-fold 
increase in the K M ' RNA (from 0.19 uM to 1.6 uM) as well 
as a slight increase in /c cat (1.4 s -1 versus 2.8 s -1 ) (Table 1). 
Thus, we conclude that the linker region is important for 
binding, and speculate that it acts as a hinge facilitating 
closure between the helical and tail domains upon tRNA 
binding. 



DISCUSSION 

The observations that the NTD of S. cerevisiae GlnRS 
bears a substantial structural resemblance to two 
domains of the bacterial GatB amidotransferase that 
distinguish tRNA Gln from tRNA Glu , and that the 
NTD also participates in tRNA Gln binding, imply that 
there is a connection between the indirect pathways for 
formation of Gln-tRNA Gln in bacteria and archaea, and 
the direct pathway that evolved in eukaryotes. Since it 
is thought that tRNA Gln was present in the last univer- 
sal common ancestor, it has been puzzling that amino- 
acylation of this tRNA is achieved by different routes in 
each of the three kingdoms. Sheppard and Soil proposed 
that both GatCAB and GatDE were present prior to the 
split between archaea and bacteria (51), while the specific 
GlnRS evolved in eukaryotes. We propose that the 
tRNA Gln recognition domain from an amidotransferase 
was most likely conscripted as an NTD to a progenitor 
non-discriminating GluRS, and thus played an inte- 
gral part in the development of the eukaryotic GlnRS 
family. In particular, evolution of GlnRS from an 
early non-discriminating GluRS required selectivity deter- 
minants in favor of tRNA Gln to evolve, while nega- 
tive determinants against tRNA Glu would also appear. 
The proximity of the NTD to the tRNA-synthetase 
core domain suggests that eukaryotes may have ex- 



ploited the NTD domain to provide subtle structural 
discrimination between tRNA and tRNA Glu prior to 
the appearance of discriminatory residues in other 
synthetase domains conserved between eukaryotes and 
bacteria. 

In support of this, we find evidence that the NTD of 
GlnRS likely existed in the common eukaryotic ances- 
tor, based on comparative genomic reconstruction of the 
Gln4 family (52). Thus, GlnRS proteins from highly 
diverse, free living eukaryotes, spanning lineages 
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from the ancient JEH and POD clades through more 
recent clades (including Plantae, Amoebozoans and 
Opisthokonts) share a recognizably homologous, but 
diverse, NTD of 210-259 amino acids (Figure 4B and 
Supplementary Figure S4). Curiously, we and others (53) 
have also found that the appended domain is absent in 
some eukaryotes, including parasitic protozoa such as 
Trypanosoma brucei and Leishmania major, as well as 
the Eurotiomycetidae, Trichocomaceae fungi. There also 
appears to be a correlation between the presence of the 
appended domain and the use of U 73 as the discriminator 
base (Supplementary Figure S5). Thus, although an 
appended domain is not required to construct a specific 
GlnRS, such a domain was likely a part of the specific 
GlnRS in the eukaryotic common ancestor and may 
have played a crucial role in the development of a 
specific GlnRS. 

Our findings also point to a parallel between the 
appended domains in eukaryotic GlnRS proteins and in 
GlnRS in the bacterium D. radiodurans (14), even though 
the eukaryotic domains are located on the N terminus, 
upstream of the conserved core, while the appended 
domain of the D. radiodurans GlnRS is on the C 
terminus, downstream of the conserved core. Although 
the Gln4 NTD and the D. radiodurans GlnRS CTD have 
no significant sequence similarity (14), and are at opposite 
termini, it is likely that the D. radiodurans GlnRS CTD, 
like the Gln4 NTD, is structurally related to GatB, 
because the CTD has weak sequence homology with 
regions of GatB, and cross reacts with GatB antibody (14). 
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